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CONTROL OF GENE TRANSCRIPTION 

The present invention relates to a gene which encodes a 
factor required for transcription in plant cells and a 
method for the control of transcription using that gene. 
Development, form and function of plants are brought 

5 about by the timed and co-ordinated expression of many 

thousands of genes. These genes are expressed in specific 
tissues, at different times, for different periods of time, 
in response to different end signals. The activation and 
repression of gene expression is controlled through the 

10 interaction of protein factors with specific DNA sequences 

associated with genes. Factors controlling the initiation of 
transcription of protein coding genes by RNA polymerase II 
(B) can be divided into two groups, general factors and 
activators. General transcription factors are responsible 

15 for assembly of the pre-initiation complex at the 'TATA' 
boxr and accurate basal level transcription initiation. 
Activators are an heterogeneous class of sequence-specific 
DNA binding proteins that interact with the pre-initiation 
complex to bring about regulated high level transcription of 

20 the associated gene. Interaction of activators with the 

pre-initiation complex is presumed to be an important level 
of control for gene expression. 

Assay systems for the analysis of transcription 
initiation in-vitro have been described for animals and 

25 plants. These contain general transcription factors and RNA 
polymerase II derived from specific organs and cell types, 
and have been used in the analysis of both cis-sequences and 
activators. An important feature of such systems is the 
higher level regulated transcription which can be obtained 
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in the presence of added activator protein r indicating that 
the activator interacts with one or several components of 
the general transcription machinery. In animal and yeast 
systems many of these general factors have been identified 

5 and purified. From such studies several consistent features 
of the number and type of factors have emerged, as has their 
order of assembly on the DNA. The most important of the 
general factors is Transcription Factor IID (TFIID), 
necessary for template commitment (as it binds to the TATA 

10 box) which may interact with at least one activator protein 
via a 'coactivator' . Following the binding of TFIID, other 
factors (TFIIA/G, TFIIB, RNA polymerase 2 and TFIIE/F) are 
recruited to form the fully assembled complex. In addition 
to TFIID, TFIIB has recently been shown to directly interact 

15 with activator proteins in animal systems* Recent analysis 
of TFIID from Hela cells has revealed that the endogenous 
factor is composed of a single DNA-binding protein (the 
TATA-binding protein, TBP) and several other unrelated 
components. TFIID TBP-component is composed of both 

20 conversed and species-specific elements. Homology between 
TBPs is at least 80% or greater in the carboxy-terminal 200 
amino acids. The amino terminal ends of TBPs are highly 
disparate, and were initially presumed to play some role in 
species-specific interactions that regulate transcription 

25 initiation* Although it is possible that this does occur 
other evidence has pointed to functional differences being 
localised to the highly conserved carboxy-terminal region. 
Reports to date have focused on analysis of the physical 
requirements of protein structure for TBP function. 

30 An object of the present invention is to provide means 

for controlling gene transcription in plants. 

According to the present invention there is provided a 
cDNA having the sequence given in Figure 1 herewith and 
variations therein permitted by the degeneracy of the 
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genetic code, and any equivalent genomic sequenc encoding 
TFIID TBP-component to which said cDNA hybridises. 

The invention further provides the cDNA located on 
plasmid pTFIID which has been deposited, in an Escherichia 

5 coli , strain DH5a, host, under the terms of the Budapest 

Treaty on the Deposit of Microorganisms for Patent Purposes, 
with the National Collection of Industrial and Marine 
Bacteria, Aberdeen, united Kingdom, on 19th December 1991 
under the Accession Number NCIB 40467. 

10 Also, the invention provides a method of inhibiting 

the growth of a plant cell comprising stably incorporating 
into the genome of the said plant by transformation a full 
or partial length copy of a nucleotide which is antisense to 
the cDNA defined hereinabove or its genomic equivalent. 

15 Such inhibition may also be obtained by stably 

incorporating into the genome of the said cell by 
transformation a full or partial length copy of the cDNA 
defined hereinabove or its genomic equivalent. 

Further according to the invention there is provided a 

20 method for the isolation of the TFIID promoter, comprising 
probing a genomic library with the cDNA of the invention, 
isolating a genomic sequence which hybridises to the said 
cDNA and recovering from the isolated genomic sequence th 
promoter region lying upstream of the TFIID gene. 

25 The invention thus also provides the promoter sequence 

of the TFIID gene. 

Because the TFIID is consti tutively expressed, the 
invention also provides a method for the induction of 
constitutive gene expression, comprising placing a select Id 

30 gene of interest under control of the promoter of the TFIID 
gene . 

Therefore, the invention further provides a recombinant 
gene construct comprising in sequence a TFIID promoter, a 
selected gene for expression and a transcription terminator 
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region. 

The TFIID gene can be used as a target to inhibit 
growth of a plant cell by several methods. In addition the 
gene sequences can be used to isolate the gene in question 

5 which is a truly constitutive gene. TFIID is a DNA-binding 
protein or factor which binds to DNA sequences found on 
plant promoters such as the "TATA box", interaction of 
other factors involved with the initiation of transcription 
or the enhancement of transcription with the factor is 

10 specifically required for gene expression in plants. 

The cDNA can be used to isolate the gene encoding it* 
This is described in detail below. This gene is special in 
that due to the requirement that it be expressed in all 
tissues of a plant at all times, it is expressed 

15 constitutively . Therefore the promoter controlling it is 
must be a truly constitutive promoter. 

In addition, inhibition of the expression of this gene 
in specific tissues or at a specific stage during plant 
development is expected to lead to the inhibition of plant 

20 development at that given time. This can be used to inhibit 
flowering, pollen formation, embryo development and seed 
formation among others. 

The present invention also provides a plant having 
stably incorporated in its genome by transformation a gene 

25 construct carrying a tissue-specific or a development- 
specific promoter which operates in the cells of the target 
plant tissue and a DNA binding protein gene construct which 
is capable, when expressed, of inhibiting transcription in 
the cells of the said target tissue resulting in death of - 

30 the cells. 

One preferred application of this invention is in the 
creation of male sterile plants for use in the production of 
hybrids. Our International Patent application WO 90/00110, 
incorporated herein by reference, relates to the production 
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of such male sterile plant lines. Essentially our inv ntion, 
the subject of that International Application, is a plant 
gene construct comprising a disrupter gene encoding a 
protein which is capable of disrupting the biogenesis of 

5 viable pollen, and a gene regulatory sequence and which 
includes a promoter sequence inducible by external 
application of an exogenous chemical inducer to a plant 
containing the construct. In a specific embodiment, the 
construct comprises: 

10 (a) a first gene promoter sequence responsive to the 

presence or absence of an exogenous chemical inducer , 

(b) a gene encoding a repressor protein under control 
of the said first promoter sequence; 

(c) an operator sequence responsive to the said 

15 repressor protein expressed by the repressor protein gene; 

(d) a male flower specific gene promoter sequence 
expressible only in male parts of a plant; and, 

(e) a gene encoding a disrupter protein capable of 
disrupting biogenesis of viable pollen; 

20 whereby the presence or absence of the exogenous 

chemical inducer in the plant enables selection of male 
fertility or sterility* 

Examples of the disrupter protein are; the mammalian 
uncoupling protein (UCP) gene, a mutated form of the gene 

25 for the p-subunit of Fj^-ATPase which has sequences added or 
deleted such that these changes result in the retention of 
the ability to assemble with other subunits but interfere 
with function as an ATP synthase, a mutated, synthetic form 
of the oli 1 gene encoding subunit 9 of the F^-ATPase, a * 

30 mutated form of a mitochondrial transit pre-sequence which 
malfunctions during transfer resulting in the disruption of 
protein transport to mitochondria, and, a gene construct 
carrying a fusion between the g-subunit gene from yeast and 
the e-glactosidase gene from E. coli , resulting in 
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expression of a disrupting fusion protein. 

The present invention provides an alternative means of 
disrupting pollen formation comprising inhibition of the 
TFIID gene. 

5 The invention also provides a plant, particularly a 

monocotyledonous plant, and more particularly a corn plant, 
having stably incorporated within its genome a gene 
construct carrying a tissue-specific promoter which operates 
in the cells of the said target gene constructs may be used 

10 as a means of inhibiting cell growth in a range of organisms 
from simple unicells to complex multicellular organisms such 
as plants and animals. By the use of tissue- or 
cell-specific promoters, particular cells or tissue may be 
targeted and destroyed within complex organisms. One 

15 particular application could be the destruction of cells 
essential for male flower development, leading to male 
sterility or the inhibition of seed development leading to 
plants which do not carry seeds. 

The invention therefore provides a method of preveating 

20 or inhibiting growth and development of plant cells based on 
gene constructs which inhibit transcription and gene 
expression. The technique has wide application in a number 
of crops where inhibition of particular cells or tissue is 
required. 

25 Of particular interest is the inhibition of male 

fertility in maize for the production of Fl hybrids in situ. 

The method of transformation is not particularly 
germane to this invention and any method suitable for the 
target plant may be employed. Transgenic plants are 

30 obtained by regeneration from the transformed cells. 
Numerous transformation procedures are known from the 
literature such as agroinf ection using Agrobacterium 
tumefaciens or its Ti plasraid, electroporation, 
microinjection of plant cells and protoplasts. 
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micropro jectile transformation and pollen tube 
transformation, to mention but a few. Reference may be made 
to the literature for full details of the known methods. 
The invention will now be described in following 
5 Examples. 

EXAMPLE 1 

cDNA library synthesis and screening 

A cDNA library was constructed in Lambda gt-11 using 
polyadenylated RNA obtained from young potato ( Solanum 

10 tuberosum L.cv. Desiree) tubers according to the 

manufacturers recommendations (Ainersham). The cPNA library 
(10^ recombinant phage) was screened with a TBP cDNA clone 
from Arabidopsis thaliana L, and the inserts from 
hybridising clones were sub-cloned into pUBS-3. The 

15 complete DMA sequence of cDNA inserts was determined on both 
strands by sequencing of plasmid UNA, and DNA and derived 
protein sequence was analysed using Mac Vector DNA sequence 
analysis software. 

A TBP cDNA clone from Arabidopsis thaliana L was used 

20 to screen a cDNA library from potato tubers • From an 

initial screen of 10^ clones from an amplified portion of 
this library one clone was isolated ( ST-1 ) , and from a 
subsequent screen of 10^ clones from the primary library a 
further three clones were isolated (ST-2, ST-3, ST-4). The 

25 complete nucleotide sequence of ST-2 (representing the 
largest spliced cDNA insert) was determined (Figure 1), 
This DNA sequence was compared to those available in the 
EMBL-database and indicated a high level of homology between 
ST-2 and TBPs from both human and yeast sources • 

30 The cDNA ST-2 contains an open reading from 

corresponding to a protein of molecular weight 22 kDa, 
similar to the molecular weights reported for TBP from 
Arabidopsis thaliana L. Comparison of this protein sequence 
with predicted TBP from other species shows that the protein 
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encoded by ST-2 has a high degree of similarity to the 
putative TBPs 1 and 2 from Arabidopsis thaliana L (Figure 
2), These proteins are of identical length (200 
amino-acids) , and are 94.5% and 94% homologous respectively 
5 to the potato protein. Potato TBP also shares over 80% 

homology with the carboxy-terminal region of TBP from 
humans, yeast and Drosophila melanogaster , 
EXAMPLE 2 

Production and analysis of in-vitro synthesised TBP; 

10 A nested set of 3' truncations of the TBP cDNA were 

constructed by digestion of plasmids with suitable 
restriction enzymes. Plasmid DNAs containing the 
full-length TBP cDNAs were linearised at the 3' end of the 
cDNA by restriction digestion. RNA was synthesised in vitro 

15 from T7 RNA polymerase promoter located 5' to the cDNA, 

purifiedr and used to program a rabbit reticulocyte in-vitro 
translation reaction. In-vitro synthesised protein was used 
in electrophoretic mobility shift assays (EMSAs), using an 
end-labelled 360 base-pair DMA fragment (position -340 to 

20 -1-20) from the potato patatin class 1 promoter (21) as a 

specific probe. Reactions were carried out in ID^l volumes 
containing 0.1^1 reticulocyte-lysate translation product as 
previously described (20) using radiolabelled probe DNA 
(290pM, 4000cpm) and 333ng (63.4nM) polydGdC.dGdC 

25 (Pharmacia) at 25°C for 30 minutes unless otherwise stated. 
Protein/DNA complexes were resolved on native 4% 
polyacrylamide gels. 

The functional requirements of cloned potato TBP were 
determined using in-vitro synthesised protein. pST-2 was * 

30 used to produce potato TBP protein in-vitro by transcription 
of the linearised plasmid and subsequent translation of RNA 
(Figure 3). Carboxy-terminal truncated versions of ST-2 
protein were produced following linearisation of plasmid DNA 
with restriction enzymes cutting within the predicted open 
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reading frame. ST-2 protein and truncated derivatives 
synthesised in this way were tested for DNA-binding 
specificity using the electrophoretic mobility shift assay 
(EMSAs 22,26). In-vitro translation of RNA from linearised 

5 plasmids gave proteins of the correct predicted molecular 

weights. The full length protein was capable of DNA binding 
to the potato patatin class-1 gene promoter. All truncati n 
derivatives of ST-2 TBP were however not able to support DNA 
binding or transcription enhancement, within a 10-fold range 

10 of protein concentration of that shown (data not shown). 
EXAMPLE 3 

Biochemical analysis of TBP DNA binding 

Double-stranded oligonucleotides used in competition 
experiments were of the following structure (top-strand 
15 shown): 

1. Wild type 5' tcgacTTGTTTACGTGCCTATATATACCATGCTTG 
TTATATGCTCg-3' 

2. mutant 5 '-tcgacTTGTTTACGTGCCTAGGGGTACCATGCTTG 
TTATATGCTCg-3' • 

20 Nucleotides which are shown in lower case letters 

indicate changes from the wild type patatin TATA sequence 
motif (underlined), and lower case nucleotides indicate and 
added restriction enzyme site. For heat stability 
experiments protein preparations were heat treated for 10 

25 minutes, and bound DNA was separated from free DNA at 

increasing times thereafter. The relative affinity ration 
of potato TBP for polydGdC.GdC and the patatin promoter was 
calculated as previously described . The Apparent 
Equilibrium constant t^^pp), Cq (the number of binding sites 

30 in the protein preparation(K and K (the specific and 
non-specific equilibrium constants) were calculated as 
previously described. [D^**] values (expressed as moles 
polydGdC.dGdC base-pairs per litre) were obtained from 
experiments in which the amount of polydGdC.dGdC was varied 
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and patatin promoter fragment kept constant • EMSAs were 
quantified by cutting out from the dried gel and Cher nkov 
counting. A standard calibration curve indicated that the 
relationship between input DMA and counted fractions was 
5 linear. All experiments were conducted at least three 

times, and at least two separate protein/RNA preparations 
were used. Results shown are the average of three separate 
experiments. 

Biochemical characteristics of plant TBP activity were 

10 analysed using the full-length ST-2 protein (Figure 4). 

Analysis of the physical properties of the ST-2 protein-DNA 
interaction indicated that maximum DNA binding occurred at a 
KCl concentration of 200mM, and MgClj of 5mM. DTT did not 
influence DNA binding capacity under optimised conditions 

15 (data not shown). TBP was shown to be heat-labile, more 

than 80% of the available activity being destroyed by a 10 
-minute incubation at 45*^0. Maximum DNA binding occurred 
between 25-30*Cr the apparent equilibrium constant after 30 
minutes incubation being reduced substantially at lower and 

20 higher temperatures. On and off-rate measurements for th 
protein-DNA complex indicated that both association and 
dissociation of TBP with DNA is slow (saturation for 
association occurring after 30-40 minutes, and only 20% TBP 
being dissociated after an additional 20 minutes in the 

25 presence of excess specific competitor DNA). 

The copolymer polyGdC.dGdC (used as non-specific DNA in 
EMSA experiments) was a very inefficient competitor for TBP 
binding to the patatin promoter (Figure 4). Specificity of 
DNA binding for the TATA box was demonstrated in competition 

30 experiments in which an oligonucleotide containing the wild 
type TATA motif from the patatin gene (see Figure 3) was 
shown to compete effectively with the patatin gene promoter 
for ST-2 binding, whereas an oligonucleotide containing a 
mutated TATATATA box (replaced with TAGGGGTA) was competed 
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at a very much lower level. The relativ affinity of the 
wild type TATA-box dna sequence was measur d with respect to 
the mutated sequence and polydGdC.dGd.C by analysis of the 
levels of competitor DNA that reduce TBP DNA binding 50% 

5 (relative affinity ratio; 22,26). Results indicate that 

potato TBP binds the copolymer polydGdC .dGdC with 
1.77xl0^-times lower affinity. 

The apparent equilibrium constant (K^^pp) under 
optimised reaction conditions for potato TBP binding to the 

10 patatin promoter TATA-box was calculated from the binding 

data (Figure 5d), and in four independent experiments gave a 
value 2.4x10^ m"^+1 . 1x10^^ . The values of Kg (the specific 
equilibrium constant), and the K^^ (the non-specific 
equilibrium constant) were calculated from Figure 5e and f, 

15 and were 5x10^ m"^ and 3.65x10^ m"^ respectively. 
EXAMPLE 4 

Isolation of the gene encoding TFIID 

In all non-plant organisms investigated so far TBP is 
encoded as a single gene. In contrast, TBP from Arabidopsis 

20 is present in at least two copies. To determine the 

representation of ST-2 TBP sequences in the potato genome, 
genomic southern experiment were performed with potato DMS. 
The probe (pst-2) hybridised strongly to a single fragment 
of potato DNA when using restriction enzyme Bgll and to two 

25 DNA fragments when using EcoRI . Very much weaker 

hybridisation is also visible to one additional DNA fragment 
in the Bgll digested DNA upon longer exposure of the filter 
to film. The clone ST-2 does not contain any EcoRI sites, 
and only a single Bglll site at position 35. Analysis f 

30 the sequence of an incomplete partially spliced potato TBP 

cDNA clone ST-1 did not reveal any additional EcoRl or Bglll 
restriction sites. These data suggest that in potato TBP is 
encoded in the genome as a low copy number sequence, 
possibly as a single gene per haploid genome. 
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A genomic library of potato was sere ned using the cDNA 
clone. A genomic clone can be isolated using this approach. 



wo 93/15213 



-13- 



PCT/GB93/00I37 



Inttmadonal ApollcAion No: PCT/ 



MICROORGANISMS 

Sh«M tm coffWMctiM wttn ifi« McrMrtwvM M •*» •m* — . liM—T. ^» — of I 



A. lOamiPICATtOH OP oiroMT* 



NAHM mt «Mo«unr 



NATIONAL COLLECTION OF INDUSTRIAL £ MARINE BACTERIA 



AMtma mi aipaaittf? mmnwmm ttncMMUit »m 

23 St. Machar Drive, 
Aberdeen AB2 IRY 
United Kinadom 


laicMaaMCM 








Oat* mt ••wwm * 

19th December 1991 


NCIMB 40467 


S. AOOmOMAk inmCATlOnB » Claa»aai 


ifttii Mt ■aaiiri 




L TMa mmmmmi i» caauawa mi a a* 


aarawanM 


nra«aiiaai ^ 



I • (H IM MMcauan* am mm <W W <— lr«»««* UMmI 



ALL STATES DESIGNATED ON THE REQUEST FOR GRANT 



\ • (laava Mana M fia« a»aii c a1a l 



TK* Matcatiana toal** «M aa aaawwiiaa ta iha NitaraaMaai ftvf aaa laiaf • (Spacity lha >i w ira* Mlava at ttia laaicauaita 

" Accaaaiaa MwMiaar at Oaaaa* **) 



t. Thi» •MM f*€«*vaa m«ttt in* latarnaiieaai aaebuuaa wiiaa Aiad lie ba chactaa ft? lha racaivtAf Omea) 

(Awiaantaa Offlearl 
n Tha aaia at racatai (iroin taa apaitcaiu) a? tha latanwiiaaai Aitfaaa 

waa *" • 

(Awiaaniaa Offlcar) 



wo 93/15213 



PCr/GB93/00137 



-14- 



CLAIttS 



1. A cONA having the sequence given in Figure 1 
herewith and variations therein permitted by the 
degeneracy of the genetic code, and any equivalent 
genomic sequence encoding TFIID TBP-component to which 
said cDNA hybridises. 

2. The cONA located on plasmid pTFZID which has been 
deposited, in an Escherichia coli , strain DH5a, host^ 
under the terms of the Budapest Treaty on the Deposit 
of Hicroorganisms for Patent Purposes, with the 
National Collection of Industrial and Marine Bacteria, 
Aberdeen, united Kingdom, on 19th December 1991 under 
the Accession Number NCZB 40467. 

3. A method of inhibiting the growth of a plant cell 
comprising stably incorporating into the genome of the 
said plant by transformation a full or partial length 
copy of a nucleotide which is antisense to the cONA 
claimed in claim 1 or its genomic equivalent. 

4. A method of inhibiting the growth of a plant cell 
comprising stably incorporating into the genome of the 
said cell by transformation a full or partial length 
copy of the cDNA claimed in claim 1 or its genomic 
equivalent. 
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A method for the isolation of the TFIID promoter, 
comprising probing a genomic library with the cOMA 
claimed in claim 1, isolating a genomic sequence which 
hybridises to the said cDNA and recovering from the 
isolated genomic sequence th promoter region lying 
upstream of the TFIID gene. 

The promoter sequence of the TFIID gene. 

A method for the induction of constitutive gene 
expression, comprising placing a selected gene of 
interest under control of the promoter claimed in claim 
6. 

A recombinant gene construct comprising in 
sequence a TFIID promoter r a selected gene for 
expression and a transcription terminator region. 

A plant gene construct comprising a disrupter gene 
encoding a protein which is capable of disrupting the 
biogenesis of viable pollen, and a gene regulatory 
sequence and which includes a promoter sequence 
inducible by external application of an exogenous 
chemical inducer to a plant containing the construct in 
which the said disruption of pollen biogenesis is 
achieved by the method claimed in claim 3 or claim 4^ 



wo 93/15213 



PCT/GB93/00137 



1/8 



FIG. 1 (1/2) 

ATATATATATTCATCCTTCCTTGTCGCTCTACAAAGATCTTCTTCAT 
ATTATTTCTTTACAACAAGCAAAATCCTCCAATTTCAGCAAACCCTA ! 
TAGGTCAAAAGAGGAATGGCAGATCAGGGATTAGAGGGGAGCCAGCC 

MADQGLEGSQP| 

• 

CAGAATATTGTCTCAACGGTTAATTTGGACTGCAAGTTGGACCTGAA 
QNIVSTVNLDCKLDLK 

■ 

CGTTTTGCTGCAGTGATCATGAGAATTAGAGAACCAAAAACTACAGC 
RFAAVIMRIREPKTTA 
F I R P K 

AGTGAACAACAGTCAAAGTTGGCAGCCCCGAAATATGCTAGAATCAT 
SEQQSKLAAPKYARII 

CAGAATATAGTTGGTTCTTGTGATGTTAAATTTCCTATTCGACTTGA 
QNIVGSCDVKFPIRLE 

GAACTATTTCCTGGATTAATATATCGCATGAAACAACCAAAAATAGT 
ELFPGLIYRMKQPKIV 

AAGGTTAGAGATGAGACATATACTGCCTTTGAGAACATATACCCAGT 
KVRDETYTAFENIYPV 

GACTGTGGAGCATCTGCTATGACTAACTGCAAGGGGTGGTTCAAAAT 
ATCAGGCATCGGCTTTTGTGGAAGGTTTGTTTAAGTTACAATTGATA 
TGTAGCATATCATAAAATTTCAATTTAGGACCAAAAAAAGCCAAACT 
CTTGATAGCGGCCTTTTGGTTTACTTTTCCTCAAAAAAAA 
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FIG. 1 [212) 



I ATAAi\AGAATCCTTCTTTCAAATTCTTTACCCGCTACAAATTC 90 

I. GXTTTAGATTTTCAGTTTAAAGAGTTTTGTGTACGGAAGGGAC 180 
AGTGGATCTTACCAAGCACCCTTCTGGAATAGTCCCTACTCTG 27 0 
. VDLTKHPSGIVPTL 25 

AGCTATTGCACTTCAAGCTCGAAATGCAGAGTACAATCCGAAG 360 
AIALQARNAEYNPK 55 

(143) Y P 

GTTG ATTTTCGCTTCTGGG AAAAT6GTTTGT ACTGGAGCC AAA 450 
LIFASGKMVCTGAKflS 
LIF S G K V T G A K 

TCAAAAGCTTGGTTTTCCAGCCAAGTTTAAGGATTTTAAAATC 54 0 
QKLGFPAKFKDFKI 115 

AGGCCTTGCATATGCCCACGGTGCTTTCTCAAGTTACGAGCCA 63 0 
GLAYAHGAFSSYEP 145 

GCTGCTT ATTTTTGTTTCTGGGAAAATTGTC ATC ACAGG AGCC 720 
LLI FVSGKIVITGA 175 

I TCTTACCGAGTTCAGGAAGAATCAGCAATGAATTGATTTGATG 810 
LTEFRKNQQ 200 

TTGG AG ACTTGTTTGTTTTTTCTT AATTGT AGAT ATTCTAT AA 900 
GGTTGGAAGATTAGCAAGATGGGGATAACTGTGTTACTTCACC 99 0 
GGTC ATGGGTTCC AAATTTATTGTACCTGTTGAGTCT ATAATG 1080 

1120 
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FIG. 5 
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